Control Under Arbitrary Dependence

نویسنده

  • Xu Han
چکیده

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of hypotheses are tested simultaneously to find if any genes are associated with some traits; in finance, thousands of tests are performed to see which fund managers have winning ability. In practice, these tests are correlated. False discovery control under arbitrary covariance dependence is a very challenging and important open problem in the modern research. We propose a new methodology based on principal factor approximation, which successfully extracts the common dependence and weakens significantly the correlation structure, to deal with an arbitrary dependence structure. We derive the theoretical distribution for false discovery proportion (FDP) in large scale multiple testing when a common threshold is used for rejection, and provide a consistent estimate of FDP. Specifically, we decompose the test statistics into an approximate multifactor model with weakly dependent errors, derive the factor loadings and estimate the unobserved but realized factors which account for the dependence by L1 regression. Asymptotic theory is derived to justify the consistency of our proposed method. This result has important applications in controlling FDR and FDP. The finite sample performance of our procedure is critically evaluated by various simulation studies. Our estimate of FDP compares favorably with Efron (2007)’s approach, as demonstrated by in the simulated examples. Our approach is further illustrated by some real data in genome-wide association studies. This is joint work with Professor Jianqing Fan and Mr. Weijie Gu at Princeton University. To request an interpretor or other accomodations for people with disabilities, please call the Department of Statistics and Probability at 517-355-9589.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

False discovery control for multiple tests of association under general dependence

We propose a confidence envelope for false discovery control when testing multiple hypotheses of association simultaneously. The method is valid under arbitrary and unknown dependence between the test statistics and allows for an exploratory approach when choosing suitable rejection regions while still retaining strong control over the proportion of false discoveries.

متن کامل

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence.

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any SNPs are associated with some traits and those tests are correlated. When test statistics are correlated, false discovery control becomes very challenging u...

متن کامل

False Discovery Control Under Arbitrary Dependence

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of hypotheses are tested simultaneously to find if any genes are associated with some traits; in finance, thousands of tests are performed to see which fund managers have winning ability. In practice, these te...

متن کامل

Effect of Mindfulness-based Cognitive Therapy on Substance Dependence Intensity and Cognitive Emotion Regulation in Patients Under Methadone Maintenance Treatment

Objective: Substance dependence is the most critical biopsychosocial and legal problem. It has various harmful effects at the individual, familial, and society levels. The current study aimed to determine the effect of mindfulness-based cognitive therapy on reducing the intensity of substance dependence and improving cognitive emotion regulation in substance-dependent patients under methadone m...

متن کامل

A unifying theory of control dependence and its application to arbitrary program structures

There are several similar, but not identical, definitions of control dependence in the literature. These definitions are given in terms of control flow graphs which have had extra restrictions imposed (for example, end-reachability). We define two new generalisations of non-termination insensitive and non-termination sensitive control dependence called weak and strong control-closure. These are...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011